Probabilistic Anomaly Detection Method for Authorship Verification
نویسندگان
چکیده
Authorship verification is the task of determining if a given text is written by a candidate author or not. In this paper, we present a first study on using an anomaly detection method for the authorship verification task. We have considered a weakly supervised probabilistic model based on a multivariate Gaussian distribution. To evaluate the effectiveness of the proposed method, we conducted experiments on a classic French corpus. Our preliminary results show that the probabilistic method can achieve a high verification performance that can reach an F1 score of 85%. Thus, this method can be very valuable for authorship verification.
منابع مشابه
Une méthode non supervisée pour la vérification d'auteur à base d'un modèle gaussien multivarié
In this paper, we present a first study on using a distance-based outlier detection method for the authorship verification task. We have considered an unsupervised method based on a multivariate Gaussian model. To evaluate the effectiveness of the proposed method, we conducted experiments on a classic French corpus. Our preliminary results show that the proposed method can achieve a high verifi...
متن کاملA Trust Based Probabilistic Method for Efficient Correctness Verification in Database Outsourcing
Correctness verification of query results is a significant challenge in database outsourcing. Most of the proposed approaches impose high overhead, which makes them impractical in real scenarios. Probabilistic approaches are proposed in order to reduce the computation overhead pertaining to the verification process. In this paper, we use the notion of trust as the basis of our probabilistic app...
متن کاملProbabilistic Soft Error Detection Based on Anomaly Speculation
Microprocessors are becoming increasingly vulnerable to soft errors due to the current trends of semiconductor technology scaling. Traditional redundant multithreading architectures provide perfect fault tolerance by re-executing all the computations. However, such a full re-execution technique significantly increases the verification workload on the processor resources, resulting in severe per...
متن کاملAn Off-the-shelf Approach to Authorship Attribution
Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profilevs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally ...
متن کاملVerifying Multi-Agent Knowledge-Based Systems using COVERAGE
Anomaly detection, as performed by the COVER tool, has proven to be a useful method for verification of knowledge-based systems. The increasing development of distributed knowledge-based systems based upon the multi-agent architecture demands techniques for the verification of these systems. This paper describes the COVERAGE too1 -an extension of COVER designed to perform anomaly detection on m...
متن کامل